Semantic processor for knowledge extraction from texts in Russian and English
نویسندگان
چکیده
The paper is dedicated to one approach to the automatic extraction of the knowledge from natural language texts (Russian and English) with forming the Knowledge Base. It is used for the solution of the most complex problems of the linguistic processors and logical analytical systems. For this purpose the means of knowledge representation (the extended semantic networks ESN) and the tools of their processing (the language of logical programming DECL) have been designed. On this basis the universal syntactical semantic rules and ontologies have been proposed which are composed of the universal linguistic knowledge for knowledge extraction and which have been used for construction of many intellectual systems for different applications.
منابع مشابه
Linguistic Processor Semantix for Knowledge Extraction from Natural Texts in Russian and English
The linguistic processor Semantix is intended for the areas where the automatic formalization of the flows of texts in natural language is required: resume, mass media issues, information and advertising materials, mail communications, summaries of incidents, information in the criminal cases, archive materials and other texts. The objects interesting for a user are extracted from documents wit...
متن کاملPresenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملIntelligent System for Entities Extractions (ISEE) from Natural Language Texts
This paper describes a semantic linguistic processor which extracts the entities and their links from natural language texts. The conceptual model underlying the algorithmic developments is the extended semantic networks (ESN). This paper analyzes the use of the processor for text formalization in various subject fields: economy monitoring, criminal actions, mass media, terrorist activities (in...
متن کاملKnowledge-Driven Event Extraction in Russian: Corpus-Based Linguistic Resources
Automatic event extraction form text is an important step in knowledge acquisition and knowledge base population. Manual work in development of extraction system is indispensable either in corpus annotation or in vocabularies and pattern creation for a knowledge-based system. Recent works have been focused on adaptation of existing system (for extraction from English texts) to new domains. Even...
متن کاملSyntactic Complexity of Russian Unified State Exam Texts in English: A Study on Reliability and Validity
In this study we analyze texts used in Russian Unified State Exam on English language. Texts that formed small research corpora were retrieved from 2 resources: official USE database as a reference point, and popular website used by pupils for USE training “Neznaika” (https://neznaika.pro/). The size of two corpora is balanced: USE has 11934 tokens and “Neznaika” - 11918 tokens. We share Biber’...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013